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Abstract 

^ ' We explain the physical basis of a model for small globular proteins with 

^— ^ , water interactions. The water is supposed to access the protein interior in 

an "all-or-none" manner during the unfolding of the protein chain. As a 
consequence of this mechanism (somewhat speculative), the model exhibits 
fundamental aspects of protein thermodynamics, as cold, and warm unfolding 
' of the polypeptide chain, and hence decreasing the temperature below the 

I cold unfolding the protein folds again, accordingly the heat capacity has three 

characteristic peaks. The cold and warm unfolding has a sharpness close to a 
' two-state system, while the cold folding is a transition where the intermediate 

states in the folding is energetical close to the folded/unfolded states, yielding 
a less sharp transition. The entropy of the protein chain causes both the cold 
' folding and the warm unfolding. 

a ■ 

Q ; PACS: 05.70.Jk, 87.14.Ee, 87.15. Co, 87.10.+e 



1 Introduction 



' In order to have a precise function in the biological "machinery", it is important 



for proteins to have an unique conformation at physiological temperatures. This 
is termed the native state. Anfinscn ^ proved in his famous experiment with 
ribonuclease the important fact that the folding of the polypeptide chain is ther- 
modynamically determined. 

One simplified view of protein folding is that the protein is supposed to follow 
a specific folding pathway of conformations in a descending landscape of Gibbs free 
energy . This is a picture of a folding protein that is forced to follow a specific 
"path" of successive conformational steps of increasing structural order. We will 
use this pathway assumption in this paper. 

A protein in physiological environments (pH, ionic strength etc.), and tempera- 
tures is packed in a very compact way. It is then termed folded. An increase of the 
temperature will eventually denaturate the protein, i.e. it unfolds. Other ways to 
unfold the protein are for instance to change the pressure, denaturant concentration 
or the pH. The fact that proteins also unfold at low temperatures, termed as cold 
unfolding ||l^,|ri|, makes the system very unusual. A major difficulty in experi- 
ments of cold unfolding is that the temperature is around and below the freezing 
point of water. In a frozen aqueous solution, one cannot observe any conformational 
transitions |l^ ]. 

^Permanent Address: Department of Physics, Norwegian University of Science and Technology, 
NTNU, N-7491 Trondheim, Norway 



1 



A general feature of small globular proteins is that they thermodynamically seems 
to unfold in an "all-or-none" manner. This means that they unfold cooperatively 
without noticeable intermediates |p"3[~jl8[|, with a deviation from a two-state sys- 
tem not exceeding 5%. The deviation from a single macroscopic system can be 
explained by presence of unstable intermediates ||, |l^, ^ . It is worth noting that 
all these experiments have been done only for the warm unfolding. The occurrence 
of intermediate states in larger proteins P,|^,p^] is not a contradiction to the two- 
state behavior in the experiments in Refs. |1^18|, because the latter only considers 
small globular proteins. 

The van't Hoff enthalpy relation (for heat of reaction) ||l^,[2l|| 

AC 

AH = akBT^— , (!) 

is a powerful way to quantify the sharpness of a smoothed out first order phase 
transition. As shown in Fig.|^, Tc is the transition temperature (at the middle 
of the peak), Q, which is the same as AH (no pressure), is the released energy 
(latent heat), and AC is the peak height of the transition, a is a dimensionless 
proportionality factor. For a given AH and Q, then the value of a is inversely 
proportional to AC. In this respect a smaller a corresponds to a sharper transition. 



In this article we will explain the physical basis of a protein model, that refor- 
mulates the water interactions proposed in earlier models by Hansen et al. ^ 
and Bakk et al. |,|. We wiU compare thermodynamical quantities, as the heat 
capacity, to experiments. The protein is also investigated in a temperature region 
below accessible experimental data. 



2 Modeling the protein 
2.1 The polypeptide chain 

The polypeptide chain is modeled as in earlier articles by Hansen et al. and 
Bakk et al. ||,^, where the protein is supposed to follow a pathway as described in 
Section|l|. The protein is equipped with TV contact points, which we here call nodes. 
A node in contact means that the protein has a "correct" conformation on the 
folding pathway, and we say that the node is folded. Due to the fact that a protein 
is a complex 3-dimensional system, a folded node likely has non-local contacts with 
respect to the amino acid sequence in the polypeptide chain. 

Each node is assigned only two energies —eg or |^,|2^. By using the binary 
variables 4>i G {0, 1}, the energy associated to each individual node is written 
Ei = —eg 4'i't'2 ■ ■ ■ 4>i-i4>i ■ The value </>i = means an unfolded node, while 4>i — 1 
is equivalent to a folded node. The product term meets the assumption about a 
folding pathway, because Ei — — eo only if all nodes before i is folded, in addition to 
a folded node i itself. For a system of N nodes the Hamiltonian of the polypeptide 
chain (vacuum energy of the protein) is 

He = -eo(01+0102 + ---+'/'l02---0Ar) . (2) 

Folding of node i is only proper in the case of an unique conformation before this 
folding step. According to Eq.||, every attempt to fold i while j (describing one or 
several nodes < i) is unfolded will not gain energy in the system, because then the 
protein is supposed to be in an energetical unfavorable conformation. 

The unfolded protein has more degrees of freedom relative to the folded, because 
the unfolded polypeptide backbone will have rotational freedom. We incorporate 
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this by assigning each unfolded node / degrees of freedom. The parameter / is 
interpreted as the relative increase in the degrees of freedom for an unfolded node 
compared to a folded node. 



2.2 Water interactions 

Interactions between water and protein surface is important. Proteins are during 
the evolution "designed" to interact with water, simply because they are exposed 
to water in vivo. Makhatadze and Privalov states that in sum hydration ef- 
fects destabilize the native state, and decreasing temperature implies increasing 
destabilizing action. The water that access the hydrophobic protein interior during 
unfolding is supposed to obtain an "ice- like" structure around the apolar surfaces. 
Hence, this structured water will both decrease the entropy and the energy com- 
pared to "free" water j2j] , and thus impacts the thermodynamics of the system. 

Hansen et al. [||,|^ proposed a simple model where each water molecule interacts 
with a node, and not with other water molecules, by a "ladder" of g equidistant 
energies accessible 

-£w + (.9 - 1)<5 

-£,„ + 2(5 (3) 
-I- 5 

which we will also apply in the model considered in this text. The interpretation 
of uji is the energy difference between a "structured" water molecule, associated to 
the unfolded parts of the protein, and a "free" water molecule in the bulk. 

The observable states in a small globular protein is either the unfolded {(pi = 0), 
with water bounded to the surface that uncovers during unfolding of the protein, 
or the folded state (0i • • • (pN = 1) with no water in the protein interior. No inter- 
mediate states are detected for small globular proteins |26|] , hence one cannot know 
for sure how the water enters the protein interior during the unfolding. Hansen 
et al. and Bakk et al. H, ^ have earlier only considered that the amount of 

water interactions increase proportional to the number of unfolded nodes, and with 
that the contact energy of the chain. In this paper we study, as a more speculative 
assumption, the case when a macroscopic contribution of water enters the protein 
surface when the last node is unfolded. 

We note that Eq.^ is the quantized energy levels of a magnetic dipole in an 
external field. In the continuum limit where g ^ oo (with g5 finite), a classical 
magnetic dipole in an external field is obtained, and this again is analogous to 
an electrical dipole in an external electrical field. The dipolar water molecules 
are exposed to an electrical field from the permanent, and induced charges on the 
protein surface, thus Eq. ^jis a representation of that. 

By using the same notation as in Eq. |^, we propose the Hamiltonian that corre- 
sponds to the water-protein interactions 

TYu, = (1 - 01 (/)2 ••• 0w) (wi -1-0^2 H hc^A/) , (4) 

where M is the number of water molecules. The folded protein is a highly ordered 
and dense packed structure where no water can access the interior. Due to Eq.^, 
unfolding of the last node {<j)N =0) implies a less dense pacing of the protein, and 
the cavities are now supposed to be big enough to let water access the interior of the 
protein. The next step, unfolding of node TV — 1, implies likely an even lesser dense 
packing, and allows more water in the protein interior. We assume in this text that 
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the water entering upon unfolding of node iV — 1 , will not interact with the protein 
surface, because it is regarded as a second layer of "free" water. Cohn and Edsall p7[ | 
states that roughly a monolayer of water is bounded to the protein, implying that 
the protein is only interacting with the first monolayer, thus the second, and third 
etc. water layers, successively entering the protein during unfolding, are regarded 
as "free water" . Hence, according to the latter possible (but somewhat speculative) 
explanation of how the water access the apolar interior of the protein, unfolding of 
nodes i < N does not contribute energetical to the water Hamiltonian {Hw) and 
thus not to the thermodynamics. 



2.3 The statistical framework 

The system Hamiltonian (Ti.) describing both chain specific energy (Wc) and water 
interactions (Ti^,) is 

TL = He + Hw = - eo {4>i + 0i02 \- 4>i4'2 ■ ■ ■ 4>n) 

+ (1 - (/>102 • • • 4>n){^1 + + ■ ■ ■ + ^A/) 

Let now Zi be term number i in the partition function which corresponds to folding 
of all nodes < i (pathway assumption), thus 

Z.^r-'e-o^ {^<N) . (6) 

/3 = 1 /T is a rescaled inverse absolute temperature in which the Boltzmann constant 
is absorbed. Zq is the term where all nodes are zero, i.e. a complete unfolded 
protein, while Z^r corresponds to a folded protein. The factor /^^* in Eq.|^ is the 
degrees of freedom in the polypeptide chain that is available in the N — i unfolded 
nodes. e"°'' is the Boltzmann factor from i contact energies — eo in the polypeptide 
chain. The last term in brackets is simply the sum over all distinct levels in one 
water molecule raised to the power of the number of water molecules M bounded 
to the unfolded parts of the protein. We assume that 5/3^1 {i.e. g oo), which 
is equal to an infinite small level spacing in Eq. ^. A first order Taylor expansion of 
the denominator in Eq. ^ yields 

Z.^f"" [j^j e^(^o'3->"/) iz<N) , (7) 

assuming 1 — e~^^^ « 1 when g — > oo. The last term in the partition function (Z^) 
corresponds to a complete folded protein, where there are g*^ degrees of freedom 
from M unbounded water molecules and N contact energies — eo, hence 

Z^ = /^e^^°^ . (8) 

By summing up the Zi terms in Eqs.^ and |^, we obtain the partition function 

N 
i=Q 

where r = e^^^'^-^ , a = eo/{gS) and fi = Ew/eo. The inverse temperature is here 
rescaled by eo /? — > /?. The parameter measures the strength of the water interac- 
tions relative the chain contact energy, thus is interpreted as an effective chemical 
potential. Changing n means adding denaturants, changing pH or salt concentration 
etc. 



[-ir) T^+^ 
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The order parameter n in this system measmes the degree of folding, i.e. the 
mean number of folded nodes divided by N 



N 



\ 13 J 1-r 



n = corresponds to an unfolded protein, while n = 1 is interpreted as a completely 
folded protein. 



3 Calculations and discussion 

The heat capacity is C — ■ {In Z) / d f3^ . Fig.^j shows a typical plot of the heat 
capacity C{T) with three peaks (numbered 1, 2 and 3 from left). These charac- 
teristic peaks corresponds to three critical transition temperatures: Ti, T2 and T3, 
measuring the temperatures at the respective peak maxima. The corresponding 
order parameter n{T) in Fig.||, calculated from Eq.|l^, shows that the protein is 
essential folded for T < Ti and T2 < T < T3, while the protein is nearly unfolded 
in the temperature intervals Ti < T < T2 and T > T3. From this picture it is 
reasonable to state that the physiological temperature interval is between peak 2 
and 3. Accordingly, with reference to this temperature region, we call peak 1 for 
cold folding and peak 2 and 3 respectively for cold and warm unfolding. Peak 2 
and 3 are both observed in experiments [^|ll| and are also seen in the model of 
Hansen et al. [^,^ and Bakk et al. |||,|^. The model considered in this paper has, 
in addition to the cold and warm unfolding, the peculiarity of cold folding. 

Experiments on cold unfolding are very difficult because most proteins unfolds 



below the freezing point of water. Chen and Chellman |11| and Privalov et al. [ p8| 
have all done experiments where the cold unfolding temperature is elevated by 
denaturants, but denaturants make the interpretation of the data very difficult. 
However, Privalov ]lO| did experiments in super cooled water, which is easier to 
interpret. Unfortunately he was not able to detect the sharpness of the cold un- 
folding, and not at all the heat capacity below the cold unfolding. This means that 
our model may predict a cold folding transition at a temperature below the cold 
unfolding transition. 

For temperatures below the cold folding (T 0) analysis of the model gives 
n « 0.99, i.e. only the last node is unfolded, and corresponds to the global energy 
minimum. From Fig.^ it is seen that Ti w 1.45. In Eq.^ the critical 
r = e^/-^^'"-^ = 1, caused by the contact energies of the polypeptide chain, implies 
Ti = l/ln2 w 1.44. Hence, this is nothing but a transition initiated of the chain 
entropy. An increase of the temperature from Ti takes the protein through a nearly 
unfolded state, whereupon the protein folds again at T2 w 1.8. The temperature is 
now so high that the energy of water (Eq.j^) is small (thermal exited) compared to 
the chain contact energy epj thus the protein prefers to fold again. Further increase 
of the temperature causes warm unfolding at « 3.0, because then the entropy of 
the chain again dominates the Gibbs free energy. It is interesting to note that the 
entropy of the chain causes two transitions, the cold folding and warm unfolding. 

We now turn our interest to the sharpness of the transitions, i.e. the value of the 
parameter a in the van't Hoff enthalpy relation (Eq.0). For M = 200 is a ~ 4 both 
for the cold and warm unfolding. This means that the protein is thermodynamically 
regarded as a two-state system that folds in an "all-or-none" manner. Privalov jl^ 
has measured a = 4.0 for the warm unfolding. As far as we know there are no 
experiments on the sharpness of the cold unfolding, but Privalov JTof indicates a 
sharpness for the cold unfolding as well, thus according to our model. The cold 
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folding transition has a w 12. This value is typical for a transition where one has 
small energy differences between the folded/unfolded states and the intermediate 
states. Remember that the "folded" state for T < Ti is actually the first node 
unfolded, thus the unfolding will essential depend on the polypeptide chain with 
the Hamiltonian in Eq.^, which can be shown corresponds to a = 12 ]8|,|20|]. 

Finally we note the consequence of a decreased fi is an increasing separation 
between the cold and warm unfolding as seen in Fig.^. This makes sense because 
a smaller /i is equivalent to a relatively smaller e^, compared to eo (see Eq. ^ , i.e. 
it is less favorable for the protein to be bounded to water. The consequence is that 
the protein prefers to be folded in a larger temperature interval, in where the water 
is expelled to the bulk. However, the transition temperature Ti is not changed 
because this transition is given by the value Ti — 1/ln/. It is also seen that a 
smaller fj, is qualitatively equivalent to a smaller a. 

An increase in M is the same as a decrease in N, because then the water becomes 
more important relative to the chain, and will again allow a broader separation 
between T2 and T3. The broader separation is also seen for a decreasing /, because 
this is equivalent to a larger M. 

Further increase of fi will eventually merge peak 1 and 2. It is interesting to 
note that a = 4 for the merged peaks, because then the transition is energetically 
dominated of the M water molecules which caused the transition at T2 in Fig.^. 

In sum the qualitative change from Fig.|| to Fig. ^, by a decreasing /z, is also 
obtained by an decrease of a, f 01 N or an increase of M. 



4 Conclusion 

We have in this paper studied a protein model with water interactions. The model 
is based on earlier models by Hansen et al. ||,^ and Bakk et al. |||,||. In contrast to 
these similar models, where the water amount was supposed to increase linearly to 
the degree of unfolding of the polypeptide chain, we have, with a more speculative 
assumption, studied the situation where a macroscopic amount of water access the 
protein interior during unfolding of the last node is the only contribution to the 
water-protein Hamiltonian. 

With reference to physiological temperatures we find that the protein exhibits 
cold and warm unfolding transitions, which is an experimental fact ||l^,|ll[. These 
transitions are associated by a sharpness indicating, from at thermodynamical point 
of view, a two-state system, which is also experimental established |p^-p^. Decreas- 
ing the temperature further below the cold unfolding region the protein folds again 
(cold folding). This folding, caused by the chain entropy, has a less sharp transition, 
which corresponds to a transition where the intermediate folding energies does not 
differ significant from the folded/unfolded energies. In sum the model exhibits three 
unfolding/folding transitions. 

It is interesting to note that both the cold folding and the warm unfolding is due 
to the polypeptide chain entropy. 
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Figure captions 



Fig. 1. Schematic illustration of the heat capacity around an unfolding transition 
showing the parameters in the van't Hoff enthalpy relation (Eq. |l|) . Tc is the transi- 
tion temperature, Q (area of the peak) is the released energy (latent heat) and AC 
is the peak height of the transition. 

Fig. 2. Heat capacity C{T) with the parameters a = 0.077, ^ = 3.3, / = 2, 
and N = M = 200, showing three peaks. With reference to the temperature region 
between peak 2 and 3 (physiological temperatures) we call the transitions: 1) cold 
folding, 2) cold unfolding and 3) warm unfolding. 

Fig. 3. The corresponding order parameter n{T) to Fig.^, describing the degree of 
folding. The chosen parameters are as in Fig. ^ n = corresponds to an unfolded 
protein, while n = 1 is interpreted as a completely folded protein. 

Fig. 4. Heat capacity where the effective chemical potential is — 3.2, slightly 
decreased from the value in Fig.^ (/j, = 3.3). All other parameters are chosen as in 
Fig. 1^. The qualitative picture is a broader separation between T2 and compared 
to Fig.|. 
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Figure 2: 
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Figure 4: 
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